The Necessity of Average Rewards in Cooperative Multirobot Learning
نویسندگان
چکیده
Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular singlerobot learning algorithms based on discounted rewards, such as Q learning, do not achieve cooperation (i.e., purposeful division of labor) when applied to task-level multirobot systems. A tasklevel system is defined as one performing a mission that is decomposed into subtasks shared among robots. In this paper, we demonstrate the superiority of average-reward-based learning such as the Monte Carlo algorithm for task-level multirobot systems, and suggest an explanation for this superiority.
منابع مشابه
Reward and Diversity in Multirobot Foraging
This research seeks to quantify the impact of the choice of reward function on behavioral diversity in learning robot teams The methodology developed for this work has been applied to multirobot forag ing soccer and cooperative movement This paper focuses speci cally on results in multirobot forag ing In these experiments three types of reward are used with Q learning to train a multirobot team...
متن کاملCrucial factors affecting cooperative multirobot learning
Cooperative decentralized multirobot learning refers to the use of multiple learning entities to learn optimal solutions for an overall multirobot system. We demonstrate that traditional single-robot learning theory can be successfully used with multirobot systems, but only under certain conditions. The success and the effectiveness of single-robot learning algorithms in multirobot systems are ...
متن کاملCrucial Factors in Cooperative Multirobot Learning
Cooperative decentralized multirobot learning refers to the use of multiple learning entities to learn optimal solutions for an overall multirobot system. We demonstrate that traditional single-robot learning theory can be successfully used with multirobot systems, but only under certain conditions. The success and the effectiveness of single-robot learning algorithms in multirobot systems are ...
متن کاملIncrease In Activity And Learning Outcomes In Pharmacy Mathematics With Jigsaw Cooperative Learning Model At Pharmacy Academy Of Dwi Farma
Introduction: In Pharmacy Diploma Program, mathematics is known as pharmaceutical mathematics. Due to the importance of pharmaceutical mathematics in practice, it is important to have a basic mathematical skill as a basis in calculations in pharmaceutical science. Therefore, it is necessary to create a lecturing condition that enables students more active in understanding the lessons. This rese...
متن کاملThe effect of constructivist-based approach of teaching in science Courses on cooperative learning of Secondary school students and its sustainability over time
Introduction: The results of international research evaluating academic achievement, which studies the process of teaching experimental sciences, have shown that Iran’s rank is lower than average results. Therefore, the special attention to the course of experimental sciences is the essential and obvious need. In this regard, the purpose of this study was to investigate the effect of teaching...
متن کامل